Multi-Dialectical Languages Effect on Speech Recognition

نویسندگان

Can Hurt

Mohamed G. Elfeky

Pedro Moreno

Victor Soto

چکیده

Research has shown that automatic speech recognition (ASR) performance typically decreases when evaluated on a dialectal variation of the same language that was not used for training its models. Similarly, models simultaneously trained on a group of dialects tend to underperform when compared to dialect-specific models. When trying to decide which dialect-specific model (recognizer) to use to decode an utterance (e.g., a voice search query), possible strategies include automatically detecting the spoken dialect or following the user’s language preferences as set in his/her cell phone. In this paper, we observe that user’s voice search queries are usually directed to a dialect-specific recognizer that does not match the user’s current location, and present a study that shows that automatically selecting the recognizer based on the user’s geographical location helps improve the user experience. Keywords—multi-dialectical languages; speech recognition; voice search

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic modelling for speech recognition in Indian languages in an agricultural commodities task domain

In developing speech recognition based services for any task domain, it is necessary to account for the support of an increasing number of languages over the life of the service. This paper considers a small vocabulary speech recognition task in multiple Indian languages. To configure a multi-lingual system in this task domain, an experimental study is presented using data from two linguistical...

متن کامل

Efficient Acoustic Modeling Method for Unsupervised Speech Recognition using Multi-Task Deep Neural Network

This paper proposes a method of acoustic modeling for zero-resourced languages speech recognition under mismatch conditions. In those languages, very limited or no transcribed speech is available for traditional monolingual speech recognition. Conventional methods such as IPA based universal acoustic modeling has been proved to be effective under matched acoustic conditions (similar speaking st...

متن کامل

Multi-lingual speech recognition system for speech-to-speech translation

This paper describes the speech recognition module of the speech-to-speech translation system being currently developed at ATR. It is a multi-lingual large vocabulary continuous speech recognition system supporting Japanese, English and Chinese languages. A corpusbased statistical approach was adopted for the system design. The database we collected consists of more than 600 000 sentences cover...

متن کامل

Multi-lingual Fingerspelling Recognition in a Kiosk for the Handicapped

This paper presents the design and evaluation of a multi-lingual fingerspelling recognition module that is designed for an information terminal. Through the use of multimodal input and output methods, the information terminal acts as a communication medium between deaf and blind people. The system converts fingerspelled words to speech and vice versa using fingerspelling recognition, fingerspel...

متن کامل

Towards Language-Universal End-to-End Speech Recognition

Building speech recognizers in multiple languages typically involves replicating a monolingual training recipe for each language, or utilizing a multi-task learning approach where models for different languages have separate output labels but share some internal parameters. In this work, we exploit recent progress in end-to-end speech recognition to create a single multilingual speech recogniti...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2015

Multi-Dialectical Languages Effect on Speech Recognition

نویسندگان

چکیده

منابع مشابه

Acoustic modelling for speech recognition in Indian languages in an agricultural commodities task domain

Efficient Acoustic Modeling Method for Unsupervised Speech Recognition using Multi-Task Deep Neural Network

Multi-lingual speech recognition system for speech-to-speech translation

Multi-lingual Fingerspelling Recognition in a Kiosk for the Handicapped

Towards Language-Universal End-to-End Speech Recognition

عنوان ژورنال:

اشتراک گذاری